Melody Extraction from Polyphonic Audio Based on Particle Filter
نویسندگان
چکیده
This paper considers a particle filter based algorithm to extract melody from a polyphonic audio in the short-time Fourier transforms (STFT) domain. The extraction is focused on overcoming the difficulties due to harmonic / percussive sound interferences, possibility of octave mismatch, and dynamic variation in melody. The main idea of the algorithm is to consider probabilistic relations between melody and polyphonic audio. Melody is assumed to follow a Markov process, and the framed segments of polyphonic audio are assumed to be conditionally independent given the parameters that represent the melody. The melody parameters are estimated using sequential importance sampling (SIS) which is a conventional particle filter method. In this paper, the likelihood and state transition are defined to overcome the aforementioned difficulties. The SIS algorithm relies on sequential importance density, and this density is designed using multiple pitches which are estimated by a simple multi-pitch extraction algorithm. Experimental results show that the considered algorithm outperforms other famous melody extraction algorithms in terms of the raw pitch accuracy (RPA) and the raw chroma accuracy (RCA).
منابع مشابه
Melody Extraction by Means of a Source-filter Model and Pitch Contour Characterization (mirex 2015)
This abstract presents our submission to the MIREX 2015 melody extraction task, whose goal is the identification of the melody pitch sequence from polyphonic musical audio. Our approach combines a source-filter model with the characterisation and analysis of pitch contours. The proposed method obtained the highest overall accuracy on several datasets among the algorithms participating in this y...
متن کاملMid-Level Music Melody Representation of Polyphonic Audio for Query-by-Humming System
Recently a great attention is paid to content-based multimedia retrieval that enables users to find and locate audio-visual materials according to the intrinsic characteristics of the target. Query-by-humming (QBH) is also an application that makes retrieval based on major characteristics of music, that is, "melody". There have been some researches on QBH system, most of which are to retrieve m...
متن کاملMelody pitch estimation based on range estimation and candidate extraction using harmonic structure model
This paper proposes an algorithm to estimate the melody pitch line (the most dominant pitch sequence) of a given polyphonic audio based on melody range estimation and pitch candidate extraction using a harmonic structure model similar to that proposed by Goto. This paper defines melody pitch candidate as a list of pitch candidates that produces the best-fit harmonic models to the polyphonic aud...
متن کاملMelody Extraction from Polyphonic Audio Signal Mirex2009
This paper describes the proposed algorithm submitted to the MIREX 2009 “Audio Melody Extraction” task. The algorithm addresses the task of extracting the predominant melody pitch from a polyphonic audio signal. The algorithm extracts the melody pitch in three steps. In the first step, transient analysis is performed on the polyphonic audio signal to determine the analysis frame length, and the...
متن کاملAn Auditory Streaming Approach for Melody Extraction from Polyphonic Music
This paper proposes an efficient approach for the identification of the predominant voice from polyphonic musical audio. The algorithm implements an auditory streaming model which builds upon tone objects and salient pitches. The formation of voices is based on the regular update of the frequency and the magnitude of so called streaming agents, which aim at salient tones or pitches close to the...
متن کامل